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LOAD BALANCING METHOD FOR EXCHANGING DATA BETWEEN 
MULTIPLE HOSTS AND STORAGE ENTITIES, IN IP BASED STORAGE 
AREA NETWORK 

RELATED APPLICATIONS 

5 This application claims the benefit under 35 U.S.C 1 19(e) from U.S. Provisional 
Application No. 60/317,860 filed September 7, 2001. 

BACKGROUND OF THE INVENTION 

Field of Invention 

The invention relates to the load balancing of traffic in an IP based Storage Area 
10 Network (SAN). More particularly, the present invention relates to dynamic assignment 
of IP addresses to storage end points according to network traffic. 

Description of the Related Art 

The Small Computer Systems Interface ("SCSI") is a popular family of 
protocols for communicating with I/O devices, in particular storage devices. More to the 
15 point, SCSI is the basic protocol for I/O operations between computers and storage 
subsystems. The SCSI protocol is, in essence, a point-to-point protocol. 

Another popular protocol is the Transmission Control Protocol/Internet Protocol 
("TCP/IP"). TCP/IP is the basic set of communication protocols for general data 
exchange between computers connected on a communication network. This is a 
20 common protocol used to communicate via the Internet. 

Currently there is a convergence between the two protocols, that is, SCSI and 
TCP/IP. hi particular, computers that communicate with their subsystems via SCSI are 
now tending to be interconnected via the Internet and utilizing TCP/IP to communicate 
with each other. 

25 In view of this convergence, a standard (ietf draft) has been proposed for 

implementing SCSI over TCP. The currently proposed standard "draft-ietf-ips-/5C5T- 
13.txt" is available at http://www.ietf. org/intemet-drafts/draft-ietf-ipswycrf-l 3.txt 
(Satran. et. al IPS Internet Draft -ietf-ips-iscsi-15.txt [online], [retrieved on Sept. 5, 
2002]. Retrieved from the Internet < URL:http://w^ 

30 15.txt> (herein expressly incorporated by reference). 

1 
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The ietf draft aims to be fully compliant with the requirements laid out in the 
SCSI Architecture Model-2 Document (Project Leader Charles Monia , Technical 
Editor John P. Scheible Working Draft [online] [retrieved on Sept. 5, 2002]. Retrieved 
from the Internet <ftp://ffo.tl O.org/tl 0/drafts/s2p/s2p-r07b.pdf > also herein expressly 
5 incorporated by reference). This SCSI standard provides for transmitting SCSI 
commands between SCSI devices, over TCP/IP connections. Conventionally, SCSI 
devices that communicate with each other must be connected via the same SCSI bus. 
The proposed ietf draft permits SCSI devices, which are not connected via the same 
SCSI bus to communicate with each other via the Internet, utilizing the TCP/IP 
10 protocol. According to the ietf draft, SCSI devices that are separated from one another 
even by great distances may communicate with each other. The ietf draft seeks to 
describe an implementation of SCSI that operates on top of TCP as the transport 
protocol. 

The following briefly defines some SCSI conceptual terms. The end point of a 
15 typical SCSI command is a "logical unit 1 ' (LUN). Logical units include, for example, 
hard drives, tape drives, CD and DVD drives, printers and processors. A collection of 
logical units is referred to as a "target" and is directly addressable on the network. In a 
client-server model, the target corresponds to the server. An "initiator" creates and 
sends SCSI commands to the target. In the client-server model, the initiator corresponds 
20 to the client. 

A typical SCSI command results in a command phase, data phase and a response 
phase. In the data phase, information travels either from the initiator to the target (for 
example, a WRITE command), or from the target to the initiator (for example, a READ 
command). In the response phase, the target returns the final status of the operation, 
25 including any errors. A response signal, is the end of a typical SCSI command. 

The ietf draft defines a method of encapsulating those SCSI commands, SCSI 
Data and SCSI Responses over TCP/IP. It also defines methods to ensure the reliability 
of the transfer from the initiator to the target. This includes commands' numbering, 
commands' tagging and acknowledgment Also it provides means of securing data and 
30 authenticating initiators and targets to each other. 

A storage gateway is a device that connects IP networks and Storage Networks 
(e.g., parallel SCSI or Fibre Channel). On one side it connects to initiators via an IP 
network and on the other side it connects to targets comprising storage devices via for 
example, the storage bus. It receives encapsulated SCSI Commands and/or data 



WO 03/023640 



PCT/US02/28417 



from the initiators, encapsulates them and transfers the raw SCSI command/data to the 
storage device. After completion of the command, it sends back to the initiator a status 
signal and/or data over the IP network. 

A storage gateway can include more features than simple connectivity. Those 
5 can include Virilization, disk concatenation, striping, mirroring and so on. The 
exposed LUNs can be any function of the physical LUNs. The present invention is 
independent of those features. 

BRIEF SUMMARY OF THE INVENTION 

10 It is the intent of this invention to provide a method, system and/or means for 

balancing traffic on a plurality of storage entities in a storage area network. Some 
examples of storage entities include physical disks, virtual disks, clusters of physical 
disks, clusters of virtual disks and/or combinations of virtual and physical disks. 
Examples of a storage area network include an IP based network or a MAC layer based 

15 network. 

FIG. 1 is an illustration of a SAN over IP. The host servers (10) in the upper part 
are the initiators in the client-server model. Any number of host servers may be 
included, for example, as shown in FIG. 1 Host 1 to Host n. The storage endpoints (30) 
in the lower part are the target LUNs. FIG. 1 shows the LUNs as Storage 1 to Storage m 
20 as an example. However the LUNs may include, for example, hard drives, tape drives, 
CD and DVD drives, printers and processors. 

An intermediate layer is shown in FIG. 1 and may include a combination of, for 
example, Gateways (20). As illustrated in FIG. 1 any combination of Gateways (20) 
may be included. These are shown for convenience as Gateway 1 to Gateway k. The 
25 topology in this SAN over IP of FIG. 1 has any-to-any connectivity in the storage layer 
(35). The any-to-any connectivity can connect to any storage device. 

According to an embodiment of the invention, the mapping is set in the network 
by assigning an IP address of each storage entity. The IP address may be unique for 
each storage entity. 

30 According to another embodiment of the invention, each gateway listens on the 

EP addresses of the storage entities it is responsible for. There is no intersection of IP 
addresses between the gateways and each storage entity has one gateway that is 
responsible for it Thus a one-to-one routing map is implemented. 
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According to another embodiment of the invention, a statistical server 
accumulates and processes statistics of data traffic in the network from each host - 
storage entity route over a time interval. 

According to another embodiment of the invention, a statistical server is 
5 calculating the optimal routing map of each host - storage entity. The optimal routing 
map is the solution to an optimization problem, where each solution (mapping) results a 
cost function combined from the traffic variance (balance) and minimal changes from 
the previous mapping. 

According to another embodiment of the invention, the new routing map is set to 
10 the gateways. The server sets to each gateway the IP addresses it should listen via. 

The present invention proposes to take advantage of a unique addressing 
mechanism, to simplify the logic to a great extent reducing bottlenecks in the traffic and 
to balance traffic loads in the network. Thus, provide faster access and higher 
throughput from servers to the storage devices. Examples of a unique addressing 
15 mechanism may include IP, MAC but is not limited to these schemes. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The above-mentioned and other advantages and features of the present invention 
will become more readily apparent from the following detailed description and the 
20 accompanying drawings, in which: 

Fig 1 is a scheme of a system topology. The scheme describes an IP network 
(25) with n host computers (10) and k gateways (20). The gateways (20) are connected 
with an any-to-any connection (35) to m storage entities (30); 

Fig 2 is an example of a particular routing. It shows the connectivity (240) from 
25 Host 2 (210) to storage entity 3 (235) via gateway 1 (220); 

Fig 3 is a scheme of a system topology with the addition of the statistics' server 
(360). The scheme describes an IP network (325) that includes n host computers (310) 
and k gateways (320). The gateways (320) are connected with an any-to-any connection 
to m storage entities (330); 



30 



Fig 4 is a scheme of another topology. The topology includes an IP network 
(425) connected with n host computers (410) and k gateways (420). The gateways 

4 
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15 



20 



25 



(420) are connected (455) to the storage entities (468) in a Fibre Channel network (465) 
routed by FC switch or FC hub (450); and 

Fig 5 is a scheme of a topology containing n host computers (510), A: proxies 
(520) and m storage endpoints with IP interface (570). 

DETAILED DESCRIPTION OF THE INVENTION 

The following detailed description of the preferred embodiment includes many 
specific details. The inclusion of such details is for the purpose of illustration only and 
should not be understood to limit the invention. 

FIG. 1 is an embodiment of n host servers (initiators) (10) connected to k 
Gateways (20) via IP network (25). The k gateways (20) are connected to m storage 
devices (30) with an any-to-any connectivity (35). Each host can reach each disk via 
each one of the gateways (20). 

For purposes of illustration for a particular embodiment, the TCP/IP backbone 
including routers could be part of the global Internet, an Intranet, or any network 
utilizing TCP/IP. It should be noted that the invention can be applied to any system 
utilizing SCSI buses which are interconnected via a TCP/IP protocol. Thus, the 
invention could easily be implemented in a variety of network architectures. 

In this example a configuration will be used in the following discussion of the 
method and system according to the invention. Reference is made to FIG. 1, in 
connection with describing the behavior of an example network. 

The goal of the invention is to create a routine mapping from each host server 
(10) to each disk (30). By creating this mapping the gateway (20) is set through which 
each host computer (10) connects to each disk (30), such that the traffic in the gateways 
(20) on the storage sub-network side (20) (35) (30) is balanced. 

A mapping of each host computer (10) to each disk (30) is of the form of a 

matrix: 



r host x 
host 2 
M 



map {disk t disk 2 A disk m ), 



r Gateway lA Gateway hl A 
Gateway 2A Gateway 2>2 A 




M M O 

jGateway^ Gateway^ A 



M 

Gateway t 
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where 

Gateway i j e {gateway l9 K , gateway J 
5 is a gateway that connects from hostj to diskj • 



10 



15 



The traffic tf from each host computer to a disk is measured in a time interval, is 
assembled. This assembling may be performed by a specific statistic server (350) as 
shown in FIG. 3 or a priory and entered into a matrix of the form: 



traffic 



(Itost^ 
host 2 
M 

(traffic^ traffic A traffic) 
traffic^ traffic^ A traffic^ 

M MOM 
{traffic^ traffic^ A fraffic^) 

Where traffic y is the amount of data transferred between host, and disk ft the time 

interval. 



Gtraffic = 



(disk t disk r A disk m ), 



f Gateway^ 
Gateway 2 
U 

jGateway k 

(Gtraffic ltl Gtraffic^ A Gtraffic^ 
Gtrafflc u Gtraffic^ A Gtraffic^ 

M MOM 
Gtrqffic ljt Gtrqffic kt2 A Girqffic ktm> 



where G/r<#c fJ is the traffic from gateway and disk } such that: 



20 



Gtrafficjj a Gtrajpc(pateway t ,diskj)= ^traffic^ 



kis: 



GdtewayTraffic = 



r Gateffic^ 
Gateffic 2 
M 

\Gateffic k ) 



(m \ 

^trafficy 
t-i 

m 

Y* Gtra fficu 

m: 
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The optimal mapping is a map matrix that will minimize the cost function: 

inin(w w • vzt(GatewayIYqffic)+ w £kattge • change) 

where var is the variance, change is the number of changes from the previous 
mapping and w yar and w c hange are the weights assigned to the variance and changes 
5 respectively. 

The solution for the optimization problem assigns each route from host i (10) to 
storage device j (30) Gateway* j to a physical gateway / (20), i.e 

Gatewayij ->gateway u \< i<j < m,l <l<k 

The possible mappings combinations are k!"*" , thus the complexity of the 
10 optimization problem is 0(map)= A/ 71 *" . 

where k is the number of gateways (20), m is the number of disks (30) and n is the 
number of host computers (10). 

In networks, where 0(map) is computationally feasible, all alternatives are 
checked (cost function is calculated) and the best mapping is selected. The feasibility 
15 may be related to the size of the network or the availability of computational power, for 
example. In addition to computing all alternatives, heuristic algorithms can be applied 
such as simulated annealing, greedy algorithms, economic optimizations, genetic 
algorithms or tabular search. 

If a Storage Name Server (SNS) resides in the network, it contains the mapping. 
20 Each host computer (10) queries the IP Network periodically for the gateway (20) that 
connects it to the desired disk (30). 

Each gateway (20) listens on the corresponding IP addresses of the routes it is 
assigned to. That is, if host (10) i (l<=/<=«) is routed to storage entity (30) j (l<=j<=m) 
via gateway (20) / (l=/<=£), gateway / listens on the IP address of storage entity / Thus 
25 the mapping is implemented transparently for the host computers (10) and the disk (30). 

In an embodiment of this method and system for balancing traffic on a plurality 
of host storage entities with selectively optimized routing comprising physical disks, 
virtual disks, clusters of physical disks, clusters of virtual disks as well as combinations 
of virtual and physical disks in an IP based storage area network includes constructing a 
30 network that has at least one host of several hosts, at least one of several of a storage 
entities, and at least one of several storage gateways; 
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In the embodiment a unique gateway is selected for connecting between the host 
entity to the storage entity. Routing occurs via a host- gateway -storage entity. Further, 
assigning an unique IP address to each of at least one of several storage entities such 
that said host recognizes a storage entity by IP address. Then accumulating at least one 
5 of several of a statistical reading of data traffic in said network from each host-gateway- 
storage entity route over a time interval by a statistical server. The statistical server 
may process the statistical reading. This may be utilized in calculating an optimal 
routing map of each of the host gateway-storage-entity. 

The optimal routing map includes a an optimization function where each of at 
10 least one of several mapping solutions results from a cost function combined from a 
traffic variance balance and a reading of minimal changes for an a priori mapping; 

Further determining at least one of several of a mapping setting in the network 
by assigning the unique IP address of each host-gateway-storage entity to a storage 
gateway with each of one of several gateways listening for new IP addresses set to each 
15 gateway; and implementing an optimized routing mapping. 

Also in another embodiment the accumulating of statistics is done a-priory by at 
least a network manager. 

In a further embodiment the IP addresses assignment is implemented with a 
Storage Name Server configuration. Also the assignments may be determined by a host 
20 querying the server periodically for the IP address of the storage entity and granting the 
IP address according to the routing map. 

In a further embodiment the routing map is implemented by the storage 
gateways with redirections of the storage transaction commands. 

In a further embodiment the network is constructed from at least one of several 
25 hosts, several proxies and several IP storage endpoints comprised of at least one of a 
storage entity with IP interface or a storage gateway. 

The optimal routing may also comprise an optimization determined by a fitness 
function including at least one of a genetic algorithm; a simulated annealing or a tabular 
search. 

30 Fig 4 shows an embodiment of another topology. An IP network (425) is 

connected with n host computers (410) and k gateways (420). The gateways (420) are 
connected (455) to the storage entities (468) in a Fibre Channel network (465) routed by 
FC switch or FC hub (450). The storage entities (468) may include, for example, 
physical disks, virtual disks, clusters of physical disks, clusters of virtual disks as 
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well as combinations of virtual and physical disks. The system implements selectively 
optimized routing. 

This system is constructed to include at least one of several gateways (420); at 
least one of several storage databases; an IP network (425); at least one of several 

5 statistical servers (450); at least one of several FC switches or hubs (450) in any 
combination. Also included as shown in FIG. 5 are A; proxies (520), thus the 
configuration may also include at least one of several proxies; and an interconnection 
optimally routed to at least one of several storage databases (468 FIG. 4). The storage 
databases may further include IP enabled storage databases (570 FIG. 5). 

10 In another embodiment a system for balancing traffic on a plurality of host 

storage entities with selectively optimized routing in an IP based storage area network 
also includes any combination of physical disks, virtual disks, clusters of physical disks, 
clusters of virtual disks as well as combinations of virtual and physical disks. 

This system further includes at least one of several hosts linking to an IP 

15 network, a statistical service, at least one of several gateways, interconnecting at least 
one of at least one FC switch an interconnection to optimally routed to at least one 
served of several storage databases. 

In this embodiment the storage databases may further include IP enabled storage 
databases; an IP network; at least one of several statistical servers; at least one of 

20 several FC switches; at least one of several FC hubs; at least one of several proxies; 

and an interconnection optimally routed to at least one of several storage databases; said 
storage databases further comprising IP enabled storage databases. 

While this invention has been described in conjunction with the specific 
embodiments outlined above, many alternatives, modifications and variations will be 

25 apparent to those skilled in the art. Accordingly, the preferred embodiments of the 
invention is set forth above are intended to be illustrative, and not limiting. Various 
changes may be made without parting from the spirit and the scope of the invention as 
defined in the following claims. 



9 
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What is claimed is: 

1 . A method for balancing traffic on a plurality of host storage entities with 
selectively optimized routing in an storage area network, including a network of at least 
one host and at least on storage gateway, said method comprising the steps of: 

5 selecting a unique gateway for connecting between a host entity to a 

storage entity; 

routing via a host - gateway - storage entity; 

assigning an IP address to at least one of several storage entities such that said 
host recognizes a storage entity by IP address; 
10 accumulating at least one of several of a statistical readings of data traffic in said 

network from each host-gateway-storage entity route over a time interval by a 
statistical server; 

processing said statistical reading; 

calculating an optimal routing map of each of said host gateway-storage-entity 
15 said optimal routing map including a function of an optimization function where each 
of at least one of several mapping solutions results from a cost function combined from 
a traffic variance balance and a reading of minimal changes for an apriori mapping; 
determining at least one of several of a mapping setting in said network by assigning 
said IP address of each host-gateway-storage entity to a storage gateway with each of 
20 one of several gateways listening for new IP addresses set to each gateway; and 
implementing an optimized routing mapping. 

2. A method as recited in claim 1, wherein the assigning step assigns a unique 
IP address. 

3 . A method as recited in claim 1 , wherein the processing step is performed by a 
25 statistical server. 

4. A method as recited in claim 1, wherein the step of accumulating of statistics 
in the method of claim 1 is done a-priory by at least a network manager. 

5. A method as recited in claim 1, wherein the IP addresses assignment is 
implemented with a Storage Name Server configuration therein further said assignments 

30 is determined by the step of : 

a host querying the server periodically for the IP address of the storage entity 
and granting the IP address according to the routing map. 

10 
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6. A method as recited in claim 1, wherein the routing map is implemented by 
the storage gateways with redirections of the storage transaction commands. 

7. A method as recited in claim 1, wherein the network is constructed from at 
least one of several hosts, several proxies and several IP storage endpoints including at 

5 least one of a storage entity with IP interface or a storage gateway. 

8. A method as recited in claim 1, wherein said optimal routing further includes 
an optimization determined by a fitness function having at least one of a genetic 
algorithm; a stochastic simulation; a simulated annealing; a tabular search; an economic 
optimization, wherein the economic optimization further includes at least one of a cost 

10 minimization; a substitution based on a selection of a cluster of at least one of several of 
a host; at least one of several of a gateway; and at least one of several of a storage 
entity. 

9. A method as recited in claim 1, wherein the host storage entity includes at 
least one of a host storage entity having at least one of several of physical disks, virtual 

15 disks, cluster of physical disks, cluster of virtual disks or cluster of virtual and physical 
disks. 

10. A system for balancing traffic on a plurality of host storage entities with 
selectively optimized routing comprising physical disks, virtual disks, clusters of 
physical disks, clusters of virtual disks as well as combinations of virtual and physical 

20 disks in an IP based storage area network; 

at least one of several gateways; 

at least one of several storage databases; 

an IP network; 

at least one of several statistical servers; 
25 at least one of several FC switches; 

at least one of several FC hubs; 

at least one of several proxies; and 

an interconnection optimally routed to; 

at least one of several storage databases; 
30 said storage databases further comprising IP enabled storage databases. 

11. A system for balancing traffic on a plurality of host storage entities with 
selectively optimized routing comprising physical disks, virtual disks, clusters of 
physical disks, clusters of virtual disks as well as combinations of virtual and physical 
disks in an IP based storage area network; further consisting of 
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at least one of several hosts linking to an; 

an IP network; 

a statistical service; 

at least one of several gateways interconnecting to 
5 at least one of a FC switch; 

an interconnection optimally routed to 

at least one served of several storage databases; 

said storage databases further comprising IP enabled storage databases. 

an IP network; 
10 at least one of several statistical servers; 

at least one of several FC switches; 

at least one of several FC hubs; 

at least one of several proxies; and 

an interconnection optimally routed to 
15 at least one of several storage databases; 

said storage databases further comprising IP enabled storage databases. 



12 



WO 03/023640 



PCT/US02/28417 




WO 03/023640 



PCT/US02/28417 



2/5 




WO 03/023640 



PCT/US02/28417 




WO 03/023640 



PCT/US02/28417 




WO 03/023640 



PCT/US02/28417 




